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Abstract 

We present an algorithm that enables one to perform locally adaptive block thresh- 
olding, while maintaining image continuity. Images are divided into sub-images 
based some standard image attributes and thresholding technique is employed over 
the sub- images. The present algorithm makes use of the thresholds of neighboring 
sub-images to calculate a range of values. The image continuity is taken care by 
choosing the threshold of the sub-image under consideration to lie within the above 
range. After examining the average range values for various sub-image sizes of a 
variety of images, it was found that the range of acceptable threshold values is 
substantially high, justifying our assumption of exploiting the freedom of range for 
bringing out local details. 

Key words: Block Thresholding; Boundary Mismatch; Image Continuity; Image 
Variance 



1 Introduction 



Applications like document image analysis [Dawoud and Kamel (2001)], qual- 
ity inspection of materials, non-destructive testing [Sezgen and Sankur (2001)] 
etc., require the concerned images to be thresholded. Numerous methods to 
perform image thresholding exist in the literature [Trier and Jain (1995), 
Sezgen and Sakur (2004), Sahoo and Soltani (1998), Huang et al. (2005)]. 
Thresholding algorithms can be classified into the following main categories: 
Histogram shape [Rosennfeld and Torre (1983), Sezan (1985), Ramesh et al. 
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(1995), Wang et al. (2002)], where the aim of the algorithm is to find an optimal 
threshold that separates two major peaks in the histogram, is implemented 
by sending a smoothing filter on the histogram and then a difference filter 
or by fitting the histogram with two Gaussians. But the main disadvantage 
of histogram-based methods is their disregard of spatial information. Image 
entropy based methods [Pun (1980), Kapur et al.(1985), Li and Lee(1993),Li 
and Tarn (1998)] use the entropy of the image as a constraint for threshold 
selection. The two common ways in which this can be done is by the maxi- 
mization of entropy of the thresholded image or minimization of cross entropy 
between input image and the output binary image. 

General image attributes [Tsai (1985), Hertz and Schafer (1998), Gorman 
(1994), Arora et al. (2005)] can also be effectively used, where the threshold is 
selected based on some similarity measure between the original image and the 
binarized version of the image. These can take the form of edges, shapes, color 
or other suitable attributes like compactness or connectivity of the objects, 
resulting from the binarization process or the coincidence of edge fields. But 
the disadvantage of the above method lies in its complexity and the relatively 
low image quality. 

Clustering of gray level [Ridler and Calward (1978), Leung and Lam (1996), 
Kittler and Illingworth (1986), Pal and Pal (1989)], based methods aim to 
find two clusters in pixel distribution, a foreground cluster and a background 
cluster. Various algorithms exist for finding these clusters. Spatial informa- 
tion [Abutaleb (1989)] utilize information of objects and background pixels in 
the form of context probabilities, correlation functions, co-occurrence prob- 
abilities, local linear dependence models of pixels, two dimensional entropy 
etc. 

Locally adaptive thresholding based methods [White and Rohrer (1983), Niblack 
(1986), Lindquist (1999), Trier and Taxt (1995)], are characterized by calcula- 
tion of threshold at every pixel. The value of the threshold depends upon some 
local parameters like mean, variance, and surface fitting parameters or their 
suitable combinations. In one approach, the gray value of the pixel is com- 
pared with the average of the gray values in some neighborhood; if the pixel is 
significantly larger than the average, it is assigned as foreground, otherwise it 
is classified as background. Another common method adapts the thresholding 
according to local mean (/x) and standard deviation (a) over the window size. 
The threshold at every pixel (z, j) is calculated as T(i,j) = + k.s(i,j), 

for a suitable value of k. Niblack's method for thresholding is a well-known 
example of this class. The calculation of threshold at every pixel makes this 
technique relatively time consuming. 

In approaches based on global thresholds, which are faster as compared to 
their local counterparts, one calculates a single threshold value for the entire 
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image. A common example of this class is Otsu's (1979) method of threshold- 
ing; this is an iterative approach, which assumes that the gray level histogram 
is the sum of two normal intensity distributions. Since the thresholding is done 
once for the whole image, one may lose certain local characteristics. Hence, the 
thresholding of images based on local attributes have proved to be generally 
superior to the global thresholding methods in terms of final image quality. A 
number of the above thresholding methods suffer from the problem of image 
continuity, which cannot be tolerated in applications pertaining to medical 
imaging, remote sensing, optical character recognition etc., where image con- 
tinuity plays a crucial role. 

The usual method of calculation of local threshold for every pixel, with the 
help of information present in a window defined around it, is computationally 
intensive. In this paper, we present a hybrid method, where the threshold is 
calculated only once in a window. This locally adaptive block thresholding 
(LABT) algorithm makes use of the threshold values of the neighboring sub- 
images to calculate a range. Image continuity is obtained by choosing the 
threshold value of the sub-image under consideration to lie within the range 
of values specified by the algorithm. 

The above algorithm is applied to a wide variety of images and it is observed 
that the local details are preserved to a great extent. In addition to that, the 
algorithm fared better in terms of time-complexity as compared to the other 
thresholding techniques. 

Before proceeding to the details of the technique, it is convenient to define the 
following notations. In this text, S mjTl denotes a sub- image, where m and n 
denote the position of the sub-image in the matrix of sub-images. Threshold 
chosen for the sub-image S m>n on application of an appropriate thresholding 
technique, is denoted by OT m ^ n (Original Threshold). Threshold value of sub- 
image SVrt.n after application of the present algorithm, is denoted by T m n . The 
range of threshold values that S mjTl can take, without violating its continuity 
with the upper (left) sub-image, is denoted by UR mtn (Li? m ri ). 



2 Procedure For Local Block Thresholding With Continuity Con- 
straint 

The given image needs to be divided into a number of sub-images of size mxn, 
where the values of m and n can be chosen on the basis of standard image 
attributes. In this paper image variance has been chosen as that attribute. 
The image having larger variance is divided into more number of sub-images 
in order to bring out finer details, whereas an image with a lower variance is 
divided into low number of sub-images in order to be computationally inex- 
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pensive. The reason to divide a given image into sub-images based on some 
attribute (image variance in this case), is to balance between image quality 
and time complexity by choosing the right sub-image size depending upon 
the application under consideration and level of finer details to be extracted 
from the image. In a number of cases e.g., optical character recognition, the 
sub-image size is dictated by the image under consideration. The sub-image 
size can also be left as a variable, to be determined by the desired amount of 
image details. This may have usefulness to medical imaging. The number of 
rows and columns of the image are then converted to multiples of m and n 
respectively. 

Once the division of image has been done, the sub-images are scanned from 
top-left to bottom-right. Constraint is then imposed on the threshold selection 
of a sub-image by thresholds of upper and left sub-images, i.e., continuity is 
sought between S m>n and 5 m _i, n , and S m , n and S m , n -i- 

Any threshold determination technique can be used to binarize the sub-image 
S m>n , starting from Si^. The thresholds T m _ l n and T m n _\ of the neighboring 
sub-images are used to impose constraint of continuity on the threshold T mi „ 
of the sub-image S m ^ n . The choice of threshold T m>n of S m ^ n is constrained 
to a range R m ,n- This range is determined using the threshold values of the 
neighboring sub-images and the bordering pixel values of the sub-image under 
consideration. Any value in the range R m , n when used to threshold the columns 
(rows) of S rntn , that borders the adjacent sub-image S m -\. n (SVra.n-i), classify 
them into foreground or background, in the same way T m _i 5 „ (T mi „_i) clas- 
sifies the pixels of those borders. Stating in symbolic terms, if rT m ^ n denotes 
thresholding operation, i.e., classifying every pixel of a given array/matrix into 
foreground or background, using T m n , the constraint is then stated as: 

rT m , n {S m . n {outer lining)) = TT m _ 1:Jl (S m:n (outer lining)) (1) 

if the outer lining is a row of pixels and, 

TT m . n (S mtn {outer lining)) = TT rntn ^i(S rn . n {outer lining)) (2) 

if the outer lining is a column of pixels 

The range of threshold values R m , n , that a sub- image can take while main- 
taining image continuity with the upper and side block, is determined in the 
following manner. An array comprising of threshold Tm-l,n of the upper sub- 
image and pixel values of R m ,nS uppermost row (say, X), which borders the 
upper sub-image with maximum and minimum pixel values, is created. Pixel 
values in X , which are equal to T m ra , are deleted before X is added to the ar- 
ray in order get values other than T m _ l n . Appending minimum and maximum 
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pixel values ensures the presence of values, greater and less than T m _i ;n . Let 
the values that are immediately lower and greater than T m „ l n in the array, 
be Rn and Rhi, respectively. Then the range dictated by the upper sub-image 
is 

UR m ,n = [Rn+i, Rhi}- 

The classification of foreground and background pixels is done, assuming the 
definition of thresholding as X < threshold => X = background, and X > 
threshold => X = foreground. 

The same procedure is applied to determine the range LR m n dictated by the 
left sub-image. Here, X is the column of S m) „ which borders the left sub-image 
and the threshold to be added to the array is T m n _i .The effective range, 
within which the threshold of S mjn , has to be selected to avoid discontinuity, 
is 

Rm,n U Rm,n l~l LR m n . 



3 Algorithm 

The salient features of the proposed LABT algorithm can be stated in the 
following manner: 

1) The image is divided into number of same-sized rectangular sub-images 
based on the variance of the whole image. Other attributes of the image can 
also be used for this purpose. 

2) Image is then made into a multiple of the sub- image, by a suitable operation. 

3) Starting from S^i, operations are performed on the sub-images row-wise, 
i.e., the image is scanned from top-left to bottom right. 

4) An original threshold OT m)Tl of S mjn , is determined, using a suitable thresh- 
olding technique. 

5) The range R m ,n for the sub-window under consideration is worked out, 
using T m _i 5 „ and T m>n _i with the help of above method. 

6) For S^i, the threshold T^i = OT\ \. For the sub-images in the topmost 
row (leftmost column), continuity is maintained only with the left (upper) 
sub-images. 
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(a) (b) (c) 

Fig. 1. Plots showing dependence of three important characteristics on sub image 
size (averaged over 35 images) 

7) In case OT m n falls out of R m ,n, it is brought to the nearest extreme of -R m ,„, 
using the above specified procedure, and denoted by T m n . 

The above algorithm thus ensures that continuity is maintained across sub- 
images. Sub- image size can be changed depending upon the purpose, i.e., a 
smaller sub-image size can be taken to bring out finer details, whenever it is 
necessary. Bigger size sub-images are advisable for document image thresh- 
olding, where fuzzy outlines of letters need to be made well defined. A bigger 
sub-image size will help in keeping the threshold almost constant, across let- 
ters, thereby providing a consistent cut-off for removing fuzziness. 



4 Results and Observations 



We observed reduction in the average size of the range R m ,n with increasing 
sub-image size. This can be seen from Fig. la. This reduction in size is because 
of the availability of elements nearer to T m ri in the bordering array X , when 
the sub-image size gets bigger. Since the algorithm starts from S± t i and prop- 
agates downwards, it is preferable to binarize S^i with threshold obtained by 
applying threshold over the whole image. 

The second plot (Fig. lb) shows the variation of the fraction of times threshold 
exceeds the range constraint; with sub-image size averaged over 35 images. 
The fraction of times 0T m ,„ falling outside R m ,n decreases with increase in 
sub-image size. This is due to stabilization of threshold across sub-images, 
when the sub-image size is increased. This is as expected, given the large- 
scale homogeneity present in numerous images. 

We are presenting a few images for which the familiar method of threshold- 
ing, i.e., area division of cumulative distribution function (ADCDF), is used 
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(d) (e) (f) 



Fig. 2. Comparison between the results of various methods, (a) Original image, 
(b) Thresholded using ADCDF, (c) Thresholded using Otsu, (d) Thresholded using 
Niblack, (e) Thresholded using ADCDF with LABT, (f) Thresholded using Otsu 
with LABT. 




(a) (b) 



Fig. 3. (a) Original text image, (b) Thresholded using LABT. 
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Imagel 


Image 2 


Image 3 


Image 4 


Image 5 




Sise 


ieo*ieo 


240*330 


330 * 263 


319*480 


512*512 




Time (sec) 


ADCDF 


0.37 


1.64 


1.75 


2.84 


4.70 


Otsu 


0.172 


1.280 


1.468 


2.422 


4. 170 


Niblack 


1.147 


1.937 


2.040 


2.390 


4.750 


LAET+ADCDF 


0.21 


0.43 


0.89 


1.29 


3.53 


LAET+Otsu 


0.20 


0.39 


0.76 


1.17 


3.14 




PSNR (dF) 


ADCDF 


1.62 


6.18 


2.22 


5.03 


2.77 


Otsu 


1.62 


8.91 


2.24 


5.39 


5.26 


Niblack 


1.62 


I486 


6.60 


10.15 


1059 


LAET+ADCDF 


3.18 


6.37 


5.19 


5.32 


8.85 


LAET+Otsu 


7.30 


6.36 


5.41 


5.82 


8.93 




Ho. of times OT,^ 
exaeeds R^n 


LAET+ADCDF 


8 


7 


14 


11 


15 


LAET+Otsu 


12 


6 


14 


11 


14 




Total no . of times 


640 


960 


1050 


1500 


2048 




Ho. of times 
Range does not 
oi^rlap 


LAET+ADCDF 


2 


2 


2 


1 


3 


LABT+Otsu 


3 


2 


1 


2 






Table 1 

Quantitative comparision of various thresholding methods 

to binarize the sub-images. We have also used Otsu's algorithm for the same 
purpose. Superior thresholding methods, when used in conjunction with this 
algorithm, will give far better results. For the purpose of illustrating the effi- 
cacy of our procedure and comparison, we have also presented the binarized 
images, using Otsu (global) and Niblack (local) thresholding methods in Fig. 
2. One clearly sees that the present locally adaptive block thresholding method 
clearly does well in terms of extracting local features as well as retaining the 
visual image quality. We have checked this property of LABT in a variety of 
images. 

For text images, thresholding followed by morphological operation like thin- 
ning gives good results (Fig. 3a & Fig 3b). It is advisable to choose the sub- 
image size to be more than the average object size in the image. This ensures 
the whole object, to be uniformly classified as background or foreground, and 
avoids classification of within-object variation. 

The computational time and PSNR for different thresholding techniques, im- 
plemented with and without the locally adaptive block thresholding (LABT), 
has been shown in table 1. It is quite obvious from the results that the standard 
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(c) (d) 

Fig. 4. (a) Original image, (b) Inverted threshold, (c) Thresholded image, and (d) 
Final image obtained after ORing. 

thresholding techniques fare better when applied in conjunction with our al- 
gorithm. The table also shows that the number of times threshold exceeds the 
range, and number of times the range dictated by upper and side sub-image 
does not overlap for three different thresholding algorithms are quite small. 
This justifies our assumption of exploiting the freedom of range for bringing 
out local details. 

To avoid possible errors arising from the scanning of the image, row-wise from 
top to bottom, one can scan the image in different ways and perform an ORing 
operation of the different images, as specified below: 

1) Image is thresholded in the usual way. 

2) Invert the original image upside down and threshold the image. Then invert 
it back to the original state. 

3) Invert the given image right side left and threshold the image. Then invert 
it back. 

4) ORing operation is carried out on the above images to get the resulting 
image, which is equivalent to scanning the image in different ways and ORing 
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them. Not just scanning row-wise from top to bottom. 

The results of the ORing operation thus give superior results as shown in Fig. 
4. One can see much clearer local details in the final image. 



5 Conclusion and Discussion 

In this paper, a new locally adaptive block thresholding method has been pro- 
posed, which acts as a hybrid between known local and global methods. It can 
also be used in conjunction with other methods of binarization to bring out 
details of an image. It should be emphasized that the same is accomplished 
without introducing too much of time complexity, an extremely desired at- 
tribute of any binarization scheme. The present algorithm has been designed 
to ensure that the transitions between sub-windows are maintained continu- 
ously. This maintains image continuity. 

The efficacy of the method has been demonstrated in the context of a vari- 
ety of images of different types. This procedure may also be useful when a 
variable window size is required. The portions of an image requiring detailed 
investigations may be divided into finer sub images, whereas other portions 
of the image can be divided into bigger box sizes. In this case, one needs to 
explore the problem of boundary mismatch and continuity more carefully. The 
boundary mismatch can be possibly taken care by pushing the boundary of 
the block that created the mismatch, till the selected threshold falls within 
the range. This problem is currently under investigation and will be reported 
elsewhere. 
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